Aerial Monocular 3D Object Detection
نویسندگان
چکیده
Drones equipped with cameras can significantly enhance human's ability to perceive the world because of their remarkable maneuverability in 3D space. Ironically, object detection for drones has always been conducted 2D image space, which fundamentally limits understand scenes. Furthermore, existing methods developed autonomous driving cannot be directly applied due lack deformation modeling, is essential distant aerial perspective sensitive distortion and small objects. To fill gap, this work proposes a dual-view system named DVDET achieve monocular both space physical address severe view issue, we propose novel trainable geo-deformable transformation module that properly warp information from drone's birds' eye (BEV). Compared cars, our includes learnable deformable network explicitly revising deviation. dataset challenge, new large-scale simulation AM3D-Sim, real-world AM3D-Real high-quality annotations detection. Extensive experiments show i) feasible; ii) model pre-trained on helps performance; iii) DVDET also cars. encourage more researchers investigate area, released related code.
منابع مشابه
Monocular Object Detection Using 3D Geometric Primitives
Multiview object detection methods achieve robustness in adverse imaging conditions by exploiting projective consistency across views. In this paper, we present an algorithm that achieves performance comparable to multiview methods from a single camera by employing geometric primitives as proxies for the true 3D shape of objects, such as pedestrians or vehicles. Our key insight is that for a ca...
متن کامل3d Object Reconstruction from Aerial Stereo Images
Among the variety of problems in providing 3D information about topographic objects, efficient data collection and model construction are issues for research. The efforts are directed towards improving the tedious, time and man-power consuming process of data generation by applying automation. In this paper, we present a semi-automatic method for acquiring 3D topologically structured data from ...
متن کاملMonocular Vision-Based Underwater Object Detection
In this paper, we propose an underwater object detection method using monocular vision sensors. In addition to commonly used visual features such as color and intensity, we investigate the potential of underwater object detection using light transmission information. The global contrast of various features is used to initially identify the region of interest (ROI), which is then filtered by the...
متن کاملRobust 3D Object Tracking from Monocular Images using Stable Parts.
We present an algorithm for estimating the pose of a rigid object in real-time under challenging conditions. Our method effectively handles poorly textured objects in cluttered, changing environments, even when their appearance is corrupted by large occlusions, and it relies on grayscale images to handle metallic environments on which depth cameras would fail. As a result, our method is suitabl...
متن کاملMonocular Multiview Object Tracking with 3D Aspect Parts
In this work, we focus on the problem of tracking objects under significant viewpoint variations, which poses a big challenge to traditional object tracking methods. We propose a novel method to track an object and estimate its continuous pose and part locations under severe viewpoint change. In order to handle the change in topological appearance introduced by viewpoint transformations, we rep...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE robotics and automation letters
سال: 2023
ISSN: ['2377-3766']
DOI: https://doi.org/10.1109/lra.2023.3245421